Corpus: deu-eu_web_2014_100K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 97 98 99 99 99
1000 830 985 996 998 999
10000 5714 9322 9861 9940 9971
100000 34319 83163 96875 98940 99437
1000000 34319 83164 96876 98941 99438


Zipf's diagram for sentence endings


Gnuplot diagram

6040 msec needed at 2018-04-10 11:46